Tag
3 articles
The ARC-AGI-3 benchmark challenges AI systems to match untrained human performance in interactive environments, with no frontier model achieving more than 1% success. The test strips away AI's typical advantages, exposing a gap in reasoning and adaptability.
Yann LeCun has raised $1 billion for his new startup AMI Labs, marking Europe's largest seed funding round ever. Investors are betting on his vision for AI beyond LLMs.
OpenAI releases GPT-5.4 Thinking System Card, introducing enhanced reasoning and decision-making capabilities that represent a significant advancement in AI development.